LazyPIM: Efficient Support for Cache Coherence in Processing-in-Memory Architectures

نویسندگان

  • Amirali Boroumand
  • Saugata Ghose
  • Minesh Patel
  • Hasan Hassan
  • Brandon Lucia
  • Nastaran Hajinazar
  • Kevin Hsieh
  • Krishna T. Malladi
  • Hongzhong Zheng
  • Onur Mutlu
چکیده

Since 2014, I am a Ph.D. student in the department of Electrical and Computer Engineering at Carnegie Mellon University, advised by Professor Phillip B. Gibbons and Professor Onur Mutlu. I am interested in research problems that lie in the intersection of machine learning, distributed systems, and computer architecture. My current research focus is on distributed machine learning systems and near-data processing architectures.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Enabling the Adoption of Processing-in-Memory: Challenges, Mechanisms, Future Research Directions

Performance improvements from DRAM technology scaling have been lagging behind the improvements from logic technology scaling for many years. As application demand for main memory continues to grow, DRAM-based main memory is increasingly becoming a larger system bottleneck in terms of both performance and energy consumption. A major reason for poor memory performance and energy efficiency is me...

متن کامل

Structured Parallel Programming and Cache Coherence in Multicore Architectures

It is clear that multicore processors have become the building blocks of today’s high-performance computing platforms. The advent of massively parallel singlechip microprocessors further emphasizes the gap that exists between parallel architectures and parallel programming maturity. Our research group, starting from the experiences on distributed and shared memory multiprocessor, was one of the...

متن کامل

University of Delaware Department of Electrical and Computer Engineering Computer Architecture and Parallel Systems Laboratory A New Cache Protocol Based On The Order Free Consistency Memory Model

Computer architects are now studying a new generation of chip architectures that may integrate hundreds of processing cores and memory banks on a single chip with novel interconnect technologies. A key challenge lies in the design and development of an efficient on-chip shared memory organization for these future many-core architectures. New approaches need to be developed to address this chall...

متن کامل

Cache-Coherent Distributed Shared Memory: Perspectives on Its Development and Future Challenges

Distributed shared memory is an architectural approach that allows multiprocessors to support a single shared address space that is implemented with physically distributed memories. Hardwaresupported distributed shared memory is becoming the dominant approach for building multiprocessors with moderate to large numbers of processors. Cache coherence allows such architectures to use caching to ta...

متن کامل

Comparative Modeling and Evaluation of CC-NUMA and COMA on Hierarchical Ring rchitectures

Parallel computing performance on scalable share& memory architectures is affected by the structure of the interconnection networks linking processors to memory modules and on the efficiency of the memory/cache management systems. Cache Coherence Nonuniform Memory Access (CC-NUMA) and Cache Only Memory Access (COMA) are two effective memory systems, and the hierarchical ring structure is an eff...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1706.03162  شماره 

صفحات  -

تاریخ انتشار 2016